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The present invention relates to a novel gene coding 
human epidermal growth factor ("hEGF") and a process 
for preparing the same employing a recombinant expression 
vector therefor. The hEGF gene of the invention is designed 
to contain codons ubiquitous in E. cdi and die following 
restriction sites: Hpal at the 5' terminal, Pstlat the 3' texminal 
and Bpull02L Nsil, Mlul, Eco47m and AM at a regular 
manner within its internal sequence. The present invention 
also provides a process for preparing REGF by employing 
an expression vector pTE105 for hEGF, which contains 
expression cassette comprising Omp A leader sequence, 
translation termination sequence and transcription termination 
sequence and hEGF gene; and, replication origin of pUC19, 
tetracycline- resistant marker and a par site for stabilization 
in E. coli. The hEGF is produced massively in E. coli 
transformed with the pTE105 (KCCM 10027). 
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A NOVEL GENE CODING HUMAN EPIDERMAL GROWTH FACTOR 
AND PROCESS FOR PREPARING THE SAME 

5 BACKGROUND OF THE T NVENTTON 

Field of the TnvpnHnn 

The present invention relates to a novel gene coding 
10 human epidermal growth factor and a process for preparing 
the same, more specifically, to a novel human epidermal 
growth factor gene and a process for preparing the same 
employing a recombinant expression vector therefor. 

15 Description of the Prior Art 

It has been known that human epidermal growth factor 
(hereinafter referred to as 'hEGF') is a polypeptide hormone 
consisted of 53 amino acids and 3 disulfide bridges r see : 

20 Cohen, S., J. Biol. Chem. , 237:1555-1562(1962) ; Savage, 
C.R. , Jr. et al., J. Biol. Chem., 248:7669-7672(1973); 
Savage, C.R., Jr. et al., J. Biol. Chem., 247:7612-7621 
(1972)], and plays an important role on the growth control 
in mammalian cells, iirfcgs; alia epidermal and epithelial 

25 cells on molecular level [sss: Sporn, M.B. et al., Nature 
(London), 313:745-747(1985); Sporn, M.B. et al., N. Engl. J. 
Med., 303:878-880(1980)] and the treatment of injury as well 
[3££: Buckley, A. et al., Proc. Natl. Acad. Sci., USA, 82: 
7340-7344(1985)]. Further, it has been reported that the 

30 hEGF can be applied in the treatment of a stomach ulcer, 
grounded on the facts that it represses secretion of gastric 
acid into stomach [see: Gregory, H., J. cell Sci. Suppl., 3: 
11-17(1985)]. 

Under the circumstances, studies on the mass produc- 
35 tion of the hEGF has been actively carried out, since 
Starkey et al. had reported the biochemical property of hEGF 
purified from human urine[£ee: starkey, R. H. et al., 
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Science, 189:800(1975); Cohen, S. et al., Proc. Natl. Acad. 
Sci., OSA, 72:1317(1975)]; and, several researchers have 
accomplished cloning of hEGF gene by the recombinant DNA 
technology in a successful manner f see : Smith, J. et al., 
5 Nucleic Acids Res., 10:4467-4482(1982); Urdea, M. S . et 
al., Proc. Natl. Acad. Sci., USA, 80:7461-7465(1983); Oka, 
T. etal., Proc. Natl. Acad. Sci., USA, 82:7212-7216(1985)]. 
The prior art, however, have not provided the hEGF to the 
level of industrial application owing to its low activity 

10 and productivity. 

Accordingly, studies on the elevation of activity and 
productivity in hEGF manufacture have been carried out, 
which are primarily concentrated on the preparation of the 
nucleotide sequence of hEGF gene efficient for its massive 

15 production and the expression vector whose regulatory 
function is strengthened. 

SUMMARY OF TUB TMV^yppynp 

20 In accordance with the present invention, the 

inventors synthesized a novel hEGF gene and a novel 
expression vector therefor which expresses hEGF in a massive 
manner and developed a process for preparing hEGF therefrom. 
A primary object of the invention is, therefore, to 

25 provide a novel hEGF gene which is designed and chemically 
synthesized for the purpose of massive expression of hEGF in 
E. coli. 

Another object of the invention is to provide a novel 
process for preparing hEGF from a recombinant expression 
30 vector comprising said hEGF gene and regulatory sequence. 

BRIEF DESCRIPTION OF TWP. nRAWTMCg 

The above and the other objects and features of the 
35 present invention will become apparent from the following 
descriptions given in conjunction with the accompanying 
drawings, in which: 
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Figure 1 depicts sequence of designed h£6F gene of the 

invention; 

Figure 2A depicts 10 oligonucleotides synthesized; 
Figure 2B depicts assembly pattern of synthesized 10 
5 oligonucleotides of Figure 2A; 

Figure 3 depicts construction strategy of pUE118; 
Figure 4 is a photograph showing agarose gel electro- 
phoresis pattern of pUE118 carrying hEGF 
gene digested with restriction enzymes; 
10 Figure 5 depicts the Omp A leader-universal 

translation termination-Trp A 
transcription termination sequence; 
Figure 6 depicts construction strategy of pDT420; 
Figure 7 depicts construction strategy of pDE135; 
15 Figure 8 depicts construction strategy of pTC108; 

Figure 9 depicts construction strategy of pTC226; 
Figure 10 depicts construction strategy of pTE105; 
Figure 11A is a photograph showing SDS-PAGE pattern 
of hEGF expressed in E. coli JH101 
20 harboring pTE105; 

Figure 11B is a photograph showing the Western blot 
analysis of hEGF expressed in E. coli 
JM101 harboring pTE105; 
Figure 12 is a photograph showing SDS-PAGE pattern of 
25 WSGF expressed from E. coli JM101 

harboring pTE105 in each purification 
step; 

Figure 13 is a photograph showing SDS-PAGE pattern of 
purified hEGF from E. coli JM101 
30 harboring pTE105; 

Figure 14 is a chromatogram of purified hEGF by 

reverse phase HPLC; and, 
Figure 15 is a photograph showing isoelectric focusing 
analysis of purified hEGF from E. coli 
35 JM101 harboring pTE105. 
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DETATT.En INSCRIPTION OF THE INVRifTjr^ 

The inventors synthesized a hEGF gene designed to 
maximize expression of the gene which satisfies the 
5 following requirements: first, hEGF expressed therefrom 
shall be the same as natural huamn EGF in light of amino 
acid sequence and protein structure; secondly, nucleotide 
sequence shall be designed in consideration of the codons 
ubiquitous in E. coli; thirdly, the portion of secondary 

10 structure of mRNA transcribed therefrom shall be minimized; 
fourthly, many unique restriction sites shall be positioned 
as far as possible. 

in this connection, the hEGF gene of the invention is 
designed, based on the previously known hEGF sequence in the 

15 arttses: Gregory, H., Nature, 257.-325(1975)]. First of all, 
the inventors synthesized 10 complementary oligonucleotides 
by using automatic Nucleotide Synthesizer in accordance with 
solid-phase phosphate triester method r see: Narang, S. a.. 
Synthesis and Applications of DNA and RNA, Academic Press,' 

20 1987]; and, to obtain the full hEGF gene, shot-gun ligation 
method is employed. 

The hEGF of the present invention contains the 
following restriction sites: Hpal at the 5' terminal, Pstl 
at the 3' terminal and Bpull02I, Nsil, Mlul, Eco471II and 

25 Aflll within its internal sequence. Introduction of the 
hEGF gene to pUCIS which is a starting vector of the 
invention, produces the above 7 specific sites which exist 
as unique restriction sites in the recombinant vector. The 
fact that said unique sites are cleaved by their specific 

30 restriction enzymes and positioned at a regular interval, 
allows practical availability of the gene for mutagenesis 
induction, which assures high activity and stability of 
hEGF. Furtheron, the existence of H pal restriction site at 
5' terminal allows production of intact hEGF free of fusion 

35 protein from the expression vector therefor. 

The hEGF gene of the invention is inserted into pocis 
digested with Smal and Pstl, and piasmid thus obtained is 
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named pOEHS. E. coli JM109 is transformed with pUE118 
according to Hanahan's method r see: DNA Cloning Vol.1: A 
Practical Approach, IPX Press, 1985, pp 109-135], and this 
transf ormant is deposited with Korean Culture Collection of 
5 Microorganisms (KCCH) located in Department of Pood Engi- 
neering, College of Eng., Yonsei University, Sodaemun-gu, 
Seoul 120-749, Korea on April 9, 1993 in the name of E. coli 
DW/BT-2040(KCCM 10026). 

On the other hand, it has been well known that 

10 regulation of protein expression corresponding to growth 
pattern of microorganism is very important in massive 
production of protein. In this connection, tac promoter 
[figg: de Boer et al., DMA, 2:231-235(1983); Amann et al., 
Gene, 25:167-178(1983)] is introduced to the expression 

15 vector of the invention. Since the tac promoter has 
continuous two ribosome binding sites at the downstream r see: 
Shine and Dalgarao, Proc. Natl. Acad. Sci., us a, 71:1342 
(1974)], it initiates the translation of the hEGP gene 
efficiently. 

20 For the accurate and efficient expression and secre- 

tion of hEGF, the expression vector of the invention employs 
the following sequences : (l) Omp A leader sequencer see: von 
Gabain, A. et al., Proc. Matl. Acad. Sci., USA, 80:653-657 
(1983)]; (2) universal translation termination sequencer see: 

25 p. Singleton and D. Sainsbury, Dictionary of Microbiology 
and Molecular Biology, 2nd Ed., Wiley, 383, 1987]; and, (3) 
trp A transcription termination sequencer see: Christie, G. 
E. et al., Proc. Natl. Acad, sci., USA, 78:418(1981)]. 
Further, the expression vector contains par site for the 

30 stability in E. coli[ses: Austin and Abeles, J. Mol. Biol., 
169:373-387(1983)]. 

"Omp A leader-universal translation termination-trp A 
transcription termination sequence" is designed and synthe- 
sized to insert hEGP gene of the invention, which comprises 

35 restriction sites of BamHI at 5* terminal and Xbal at 3' 
terminal, and Nael and PstI between Omp A leader and univer- 
sal translation termination sequence. 
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On the other hand, it has been known that secretion of 
the expressed protein is decreased when ampicillin-resistant 
marker which codes B-lactamase, a secretory protein, is used 
to screen E. coli containing the vector comprising said 
marker, grounded on the fact that the two proteins, i.e., 
interested protein and B-lactamase, compete in the course of 
secretion [fiss: a. Oka, et al., J. Mol. Biol., 147:217 
(1981)]. Under the circumstance, tetracycline-resistant 
marker coding intracellular protein, which brings about high 
secretion of expressed protein by avoiding said competition, 
is employed in the invention instead of the ampicillin- 
resistant marker. As a result, it is clearly guaranteed 
that the expression vector of the invention expresses and 
secretes hEGF in high yield, and is stable in E. coli. 
15 A commercially available plasmid pDR540 is digested 

with Pvull and then ligated with Xbal linker; and, double 
digestion with Xbal and BamHX is carried out. 2.4kb of DHA 
fragment thus obtained is ligated with "Omp A leader- 
universal translation termination-Trp A transcription 
20 termination sequence"; and named pDT420. 

Double digestion of pDT420 with Nael and PstI produces 
a cleavage between Omp A leader and universal translation 
termination sequence; and, pDElia carrying hEGF gene of the 
invention is digested with Hpal and PstI to obtain hEGF 
25 gene. Then, the obtained hEGF gene is ligated with the 
PDT420 diqested with Nael and PstI, and the resultant is 
named pDE135. 

Plasmid pUC19 known in the art is digested with Dral 
and EcoRl to obtain 1.2kb DNA fragment whose two ends are 
blunt-ended and cohesive for EcoRI, respectively; and, 
PBR322 is digested with Aval and blunt-ended with Klenow's 
fragment, and digestion with EcoRI is followed to obtain 
1.4kb DNA fragment. The above 1.4kb and l.2kb DNA fragments 
are ligated with T 4 DNA ligase, and is named pTClos. As a 
35 result, pTClos comprises tetracycline-resistant marker, 
multiple cloning site and replication origin of pUCl9. 

A par site is introduced for the stability of 



30 
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expression vector in transf ormant and exact separation of 
plasmids after cell division. Plasmid pTC108 is digested 
with EcoRI and Smal. On the other hand, pSClOl is digested 
with Aval to obtain 3.3kb DNA fragment, which is blunt-ended 
5 with Klenow's fragment and then ligated with EcoRI linker 
and followed by digestion with EcoRI. The 3.3kb DNA frag- 
ment containing sticky end of EcoRI site is digested with 
Hindi to obtain 0.37kb DNA fragment containing par site. 
This 0.37kb DNA fragment is ligated to the pTC108 digested 

10 with EcoRI and Smal by T 4 DNA ligase, and the ligated plasmid 
is named pTC226. 

Plasmid pTC226 is digested with AflHI, blunt-ended 
with Klenow fragment and digested with Xbal in a serial 
manner to obtain 2.5kb DNA fragment. On the other hand, 

15 pDE135 is digested with Hindlll, blunt-ended with Klenow 
fragment and digested with Xbal in a serial manner to obtain 
0.45kb DNA fragment. The 2.5kb and 0.45kb DNA fragments are 
ligated with T 4 DNA ligase, and the ligated plasmid is named 
pTE105. 

20 E * coli JM101 is transformed with pTE105, and the 

transf ormant is deposited with Korean Culture Collection of 
Microorganisms (KCCM) located in Department of Pood Engi- 
neering, College of Eng., Yonsei University, Sodaemun-gu, 
Seoul 120-749, Korea on April 9, 1993 in the name of E. coli 

25 DW/BT-2042(KCCM 10027). 

Transformants E. coli DW/BT-2042 are grown in LB media 
and expression of hEGF therefrom is determined by 15% SDS- 
PAGE and Western blot analysis by employing commercially 
available hEGF (Amer sham, ARN 5100, OK) as standard. The 

30 amount of expressed hEGF is determined by hEGF receptor 

binding analysis employing A431 cell line(ATCC CRL 1555). 
Culturing said transf ormant for 30hrs gave 343.5mg/L of 
hEGF, where most of the expressed hEGF was secreted out of 
cytosoi. 

35 The present inventors isolated hEGF from the culture, 

while reducing contamination of other celluar proteins and 
endotoxins, by employing a series of chromatographic 
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purification methods, i.e., Amberchrom CG71 chromatography, 
Q-Sepharose anion exchange chromagraphy and reverse phase C 18 
preparative HPLC. The purity of purified hEGF is deter- 
mined by analytical HPLC in accordance with modified method 
5 of Hayashi et al's[s££: Hayashi, T. et al., Anal. Sci., 3: 
445-449(1987)]. 



The present invention is further illustrated in the 
following examples, which should not be taken to limit the 
10 scope of the invention. 

Example l ; Design of hEGF gene 

The hEGF gene was firstly designed, based on the hEGF 
15 amino acid sequence and Grantham et al's study on the 
frequency of codon usage in E. coliTsee: Grantham et al.. 
Nucleic Acid Res., 9:243-274(1981)]. Then, sequence which 
does not cause formation of secondary structure of mRNA 
transcribed therefrom was selected, while examining whether 
20 the gene contains codons ubiquitous in E. coli and causes 
formation of secondary structure of mRNA or not, by 
employing PC-FOLD (Version 2.0) program ( see s Turner, D. et 
al, Cold Spring Harbor Symp. Quant. Biol., 52:123(1987)]. 
The hEGF sequence thus designed carries the following two 
25 restriction sites to guarantee accurate insertion, isolation 
and manipulation of hEGF gene: Hpal at the 5' terminal; PstI 
at the down stream of translational termination codon. The 
hEGF sequence f uther comprises many unique restriction sites 
at a regular basis as followings: 

30 

HpaI-22bp-Bpull02I-39bp-NsiI-25bp-MluI- 
35bp-Eco47III-21bp-AflII-32bp-PstI 

The hEGF sequence designed in the invention is 
35 disclosed in Fig. l, which shows positions of restriction 
sites: Hpal at the 5' terminal, PstI at the 3' terminal, and 
Bpull02I, Nsil, Mlul, Eco47III and Aflll. 
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Example 2: Synthesis of oligonucleotides 

The hEGF gene designed in Example 1 was chemically 
synthesized. First of all, 10 oligonucleotides consisted of 
5 29mer to 41mer were synthesized in a separate manner. They 
are disclosed in Fig. 2A: in this connection, Cl(30mer), C2 
(35mer), C3(29mer), C4(39mer) and C5(41mer) oligonucleotides 
have the same sequence as that of mRNA transcribed from the 
corresponding hEGF sequence; and, Nl(29mer), N2(38mer), 
10 M3(29mer), N4(36mer) and N5 (38mer) oligonucleotides were 
complementary to the C5, C4, C3, C2 and CI oligonucleotides, 
respectively [fifig: Fig. 2B]. Each oligonucleotide was 
synthesized by automatic Nucleotide Synthesizer (Pharmacia 
1KB Biotechnology, Uppsala, Sweden). 

15 

Example 3: Isolation and sequencing of oligonucleotides 

Oligonucleotides thus synthesized were separated on 
the silica matrix, through treatment with TTD solution 

20 (thiophenol/triethylamine/dioxane = 1/2/2, v/v) and washing 
with methanol, ethanol and strong ammonia water in a serial 
manner. The solution containing oligonucleotide thus 
separated was subjected in strong ammonia water at 50°C for 
12hrs to remove protecting group; and concentration under 

25 vacuum condition was followed to the volume of 0.5ml, 
together with removal of gas. The oligonucleotide solution 
thus concentrated was applied on SEP-PAK cartridge (Waters 
Inc., MA, USA), and elution, was made with acetonitrile/ 
triethylamine solution to obtain partially purified 

30 oligonucleotide. Then, electrophoresis on 15% denatured 
polyacrylamide gel(TE-boric acid (pH 8.3) with 8M urea) was 
carried out and location of oligonucleotide in the gel was 
determined by the irradiation of ultraviolet rays. The gel 
corresponding to oligonucleotide band was cut out, oligo- 

35 nucleotide was eletroeluted and salts were removed on 
SEP-PAK cartridge connected with injector by eluting with 
acetonitrile/triethlamine solution. The oligonucleotides 
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thus isolated were labeled with y-"P-ATP by employing T 4 
polynucleotide kinase (New England Biolabs., #201S, USA) and 
sequenced in accordance with Maxam & Gilbert's method[sss: 
Maxam, A. M. & Gilbert, w., Proc. Natl. Acad. Sci., USA, 
5 74:560-564 (1977)]. 

Example 4; Ligation of oligonucleotides 



10 



15 



20 



lOOpmole of each oligonucleotide prepared in Example 
3, where all the oligonucleotides except for two oligo- 
nucleotides (Cl and Nl) were phosphorylated at 5' terminal, 
were placed in Eppendorf tube with the addition of 40/il 0.1M 
Tris-HCl(pH 7.5). Then, incubation was carried out at 100°C 
for 3min to denature oligonucleotides, and renaturation was 
followed by lowering temperature slowly. To the resultant 
were added lOunits of T 4 dna ligase(New England Biolabs., 
#202S, USA) and ligation buffer solution, and incubated at 
4°C for 12hr and at room temperature for 3hrs, respectively. 
Then, 7% polyacrylamide gel electrophoresis and autoradio- 
graphy were followed to determine the ligated hEGP sequence 
of the invention. 

Examp l e 5 : Construction of pUElls and its transformants 

25 pucia known in the art(s££: Norrander, J. et al, Gene, 

26:101(1985)) was digested with Smal and PstI(New England 
Biolabs., #141S and #140S, ma, USA; all the restriction 
enzymes and linkers described hereinafter were purchased 
from New England Biolabs., MA, USA), and ligated with hEGF 

30 gene of the present invention. Ligation was easily 
performed, based on its nature of blunt 5' terminal and 
sticky 3' terminal in hEGP gene as designed. Recombinant 
vector thus obtained was named pUEllST see : Pig. 3]. 

Then, pUE118 were digested with Hpal, PstI, Bpull021, 

35 Nsil, Mlul, Eco47IIl and Aflli, respectively, and electro- 
phoresis of the gene fragments produced was carried out on 
1% agarose gel. As disclosed in Pig. 4, the insertion of 
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hEGP gene was carried out in an accurate manner and it was 
confirmed that single restriction site exists for each 
restriction enzyme. In Pig. 4, Lane 1 is lkb ladder DNA 
(BRL, USA) ; Lanes 2 and 3 are pDE118 digested with Hpal and 
5 PstI, respectively; Lane 4 is pUE118 digested doubly with 
Hpal and PstI; and, Lanes 5-9 are pDE118 digested with 
Bpull02I, Nsil, Mlul, EC047III and Aflll, respectively. 

Competent E. coli JM109 was transformed with the 
PUE118; and E. coli JM109 thus transformed was deposited 
10 with the Korean Culture Center of Microorganisms (KCCM) in 
the name of Escherichia coli(DW/BT-2040) on April 9, 1993 as 
deposition No. KCCM 10026. 



Examp l e 6 : Synthesis of "Omp A leader-universal translation 
15 termination-Trp A transcription termination 

sequence" 

Oligonucleotide comprising Omp A leader, universal 
translation termination and trp A transcription termination 

20 sequence designed to have several restriction sites as 
disclosed in Fig. 5. The oligonucleotide comprises restric- 
tion sites of BamHI at 5' terminal and Xbal at 3' terminal, 
and Nael and PstI between Omp A leader and universal trans- 
lation termination sequence to assure that N-terminal amino 

25 acid sequence is exactly the same as original protein free 
of additional amino acid sequence. 

First of all, Omp A leader-universal translation 
termination-Trp A transcription termination sequence was 
synthesized as following 8 oligonucleotides of 3imer or 

30 32mer in a separate manner: 

5' GATCCAAAATTATGAAAAA6ACAGCTATC66C 3' 
5' GATTGCA6T66CACT6GCTGGTTTCGCTACC 3' 
5' 6TA6CGCAGGCCGGCCTGCAGCTTAATTAATT 3' 
35 5' AAGCAGCCCGCCTAATGAGCGGGCTTTTTTTT 3' 



5' CTA6AAAAAAAAGCCCGCTCATTAGGCGGGCT 3' 
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5' GCTTAATTAATTAAGCTGCAGGCCGGCCTGCG 3' 
5' CTACGGTAGCGAAACCAGCCAGTGCCACTGC 3' 
5' AATCGCGATAGCTGTCTTTTTCATAATTTTG 3' 

The oligonucleotides were synthesized with automatic 
Nucleotide Synthesizer(Pharmacia LKB Biotechnology, Uppsala, 
Sweden) in accordance with solid-phase phosphate triester 
xaethod[sss: Marang, s. A., Synthesis and Applications of DNA 
and RNA, Academic Press, 1987], 

In an analogy to Example 3, the synthesized oligo- 
nucleotides were separated from the silica matrix; removal 
of protecting group and concentration were followed; 
isolation of oligonucleotides, labeling with y~ m P-ATP, 
denaturation and renaturation, and ligation of oligo^ 
L5 nucleotides were performed. 

Example 7: Construction of plasmid pDT420 

Plasmid pDR540( Pharmacia LKB Biotechnology, #27-4928- 
10 01, Upssala, Sweden) was digested with PvuII(#l032) , ligated 
with Xbal linker and digested with XbaI(#i45S) and BamHI 
(#136S) to obtain 2.4kb DNA fragment. Said fragment was 
isolated from the gel after electrophoresis of digested 
fragments by employing Geneclean II DNA elution kit(BIO 101 
Inc., CA, USA), and ligated with "Omp A leader-universal 
translation termination-Trp A transcription termination 
seguence" prepared in Example 6 by T 4 DNA ligase(#202S) . E. 
coli JM101 was transformed with the ligated DNA according to 
Hanahan's method[sgg: DNA Cloning Vol. I; a Practical 
Approach, IRL Press, 109-135(1985)]. Prom the transfor- 
mants, recombinant plasmid was isolated and sequenced in 
accordance with Maxam and Gilbert's method to screen 
recombinant plasmid containing "OmpA leader-universal 
translation termination-Trp A transcription termination 
seguence". The recombinant plasmid thus selected was named 
pDT420r see : Fig. 6]. 
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Example 8 ; Construction of expression vector pDE135 and 
determination of hEGF expression 



Double digestion on plasmid pDT420 was made with Nael 
5 (#190S) and Pstl(# 140S) to cleave the sequence between Omp 
A leader and universal translation termination sequence. On 
the other hand, plasmid pUE118 prepared in Example 5 was 
digested with HpaI(#105S) and PstI to obtain 0.17kb hEGF 
gene. hEGF gene thus obtained was ligated with pDT420 

10 cleaved with Nael and PstI, and the ligated DNA was 
transformed into E. coli JM101. From the transformants, 
recombinant plasmid was isolated by alkaline lysis 
method [Sambrook et al., Molecular Cloning, a laboratory 
manual, 2nd Ed., Cold Spring Harbor, 1989]. Referring to 

15 PstI and Hindlll restriction enzyme map, recombinant plasmid 
containing hEGF gene between Omp A leader and universal 
translation termination sequence was selected and named 
pDE135[s££: Fig. 7]. 

E. coli JM101 was transformed with pDE135, and the 

20 transformant was cultured in LB media (Luria-Bertani; 
Molecular Cloning/a laboratory manual, 2nd Ed., CSH, 1989). 
Then, expression of hEGF was investigated by SDS-PAGE and 
Western blot analysis. Expression of hEGF was determined by 
receptor binding assay M. W. Rieman, Peptides, 8:877- 

25 885(1987)) employing A431 cell line(ATCC CRL 1555) and 
commercially available hEGF (Amer sham, ARN 5100, UK) as a 
standard. hEGF was expressed with a yield of lOmg/L after 
30hrs cultivation. Under the circumstance, the inventors 
assumed that the grounds of low efficiency of expression 

30 were: the biased transcriptional direction between ampi- 
cillin resistant marker and hEGF gene; competition of 
protein production; and, instability of the expression 
vector in transformant. Accordingly, the inventors 
determined to develop an improved hEGF expression cassette 

35 to solve the above confronted problems by employing 
tetracycline resistant marker whose product was not secreted 
out of cytosol, instead of ampicillin resistant marker. 
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Example 9: Construction of plasmid pTC108 

Plasmid pBR322 (Bolivar, F. et al., Gene, 2:95-113 
(1977)) was digested with AvaI(#152S), and manipulated with 
5 Klenow's fragment (#210S) to have blunt-end. Then, the 
resultant was digested with EcoRI and followed by electro- 
phoresis on 0.8% agarose gel, and the 1.4kb DNA fragment was 
obtained from gel by employing Geneclean II DNA elution kit. 
On the other hand, pUC19(Yanish-Perron, C. , et al., Gene, 

10 33:103-119(1985)) was digested with DraI(#129S) and EcoRI to 
give 1.2kb DNA fragment. The 1.2Jcb DNA fragment has repli- 
cation origin of pUC19 to maintain high efficiency of repli- 
cation and multiple cloning site to ease gene manipulation. 
1.4kb and 1.2kb DNA fragments thus obtained were ligated 

15 with T 4 DNA ligase, and E. coli JM101 was transformed with 
the ligated DNA fragment in accordance with Hanahan's 
method. From the transformants, recombinant plasmid was 
isolated by alkaline lysis. Referring to EcoRI and Afllll 
restriction enzyme map, recombinant plasmid containing 

20 replication origin of pUC19, multiple cloning site and 
tetracycline resistant marker was selected and named 
pTC108r see : Fig. 8]. 

Example 10 £ Construction of plasmid pTC226 

25 

The inventors introduced par site for the stability of 
expression vector in transformant and easy separation of 
plasmids after cell division. Plasmid pSC101( Cohen and 
Chang, Proc. Natl. Acad. Sci., USA, 70:1293-1297(1973); KCTC 

30 11251) was digested with AvaI(#l52S) and electrophoresed on 
0.8% agarose gel; and, 3.3kb DNA fragment was obtained 
therefrom by employing Geneclean II DNA elution kit. DNA 
fragment thus obtained was manipulated with Klenow's 
fragment to produce blunt-end, ligated with EcoRI linker 

35 (#1020) and digested with EcoRI. The 3.3kb DNA fragment 
containing sticky end of EcoRI site was digested with 
Hindi (#103S) and electrophoresed on 0.8% agarose gel. 
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0.37kb DNA fragment containing par site was obtained from 
electrophoresed gel by employing Geneclean II DNA elution 
kit. The obtained DNA fragment was ligated to pTC108 
digested with EcoRI and Smal(#141S) by the aid of T 4 DNA 
5 ligase, and E. coli JM101 was transformed with the ligated 
DNA fragment in accordance with Hanahan's method. From the 
transf ormants , recombinant plasmid was isolated by alkaline 
lysis. Referring to EcoRI and PstI restriction enzyme map, 
recombinant plasmid containing par site was selected and 
10 named pTC226f see : Fig 9]. 

Example lis Construction of expression vector pTE105 

The inventors constructed plasmid vector which is 

15 stable in E. coli and is of high efficiency of hEGF 
expression, by inserting hEGF gene expression cassette of 
the pDE135 into pTC226 to have same transcriptional 
direction. Plasmid pTC226 was digested with Afllll, 
manipulated with Klenow fragment to produce blunt-end, 

20 digested with XbaI(#145S) and electrophoresed on 0.8% 
agarose gel; and, 2.5kb DNA fragment was obtained from 
electrophoresed gel by employing Geneclean II DNA elution 
kit. In the same manner, plasmid pDE135 was digested with 
HindIII(#104S), blunt-ended with Klenow fragment, digested 

25 with Xbal and electrophoresed on 1% agarose gel; and, 0.45kb 
DNA fragment was obtained from electrophoresed gel by 
employing Geneclean II DNA elution kit. Each isolated DNA 
fragment, i.e., 2.5kb and 0.45kb DNA fragments, was ligated 
with T 4 DNA ligase, and E. coli JM101 was transformed with 

30 the ligated DNA fragment in accordance with Hanahan's 
method. From the transf ormant , recombinant plasmid was 
isolated by alkaline lysis. Referring to restriction EcoRI 
and BamHI enzyme map, a recombinant plasmid containing 
tetracycline resistant marker, par site, and Omp A leader 

35 and hEGF gene was selected and named pTE105[see: Fig. 10]. 
E. coli JM101 transformed with pTE105 was named DW/BT-2042, 
and deposited with the Korean Culture Collection of Micro- 
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organisms (KCCM) on April 9, 1993 as deposition No. KCCM 
10027. 

Example 12: Expression of hEGF in transformants 

5 

Transformant E. coli DW/BT-2042(KCCM 10027) was 
cultured in LB media for 5hrs with the addition of isopropyl 
B-D-thio-galactoside(IPTG, Sigma 1-6758) to the final 
concentration of ImM; and, cultures for 19hrs and 25hrs were 

10 collected and centrifuged respectively. The supematants 
were electrophoresed on 15% polyacrylamide gel (see; H. 
Schagger et al., Anal. Biochem., 166:368-379(1987)) em- 
ploying commercially available hEGF (Amer sham, ARN 5100, UK) 
as a standard. Fig. 11A showed that molecular size of hEGF 

15 expressed is determined as 6,000 dalton. Western blot 
analysis(W. N. Burnett, Anal. Biochem., 112:195-203(1981)) 
was also carried out to confirm that the produced was 
hEGF[fifig: Fig. 11B]. In Figs. 11A and 11B, Lane 1 is 
molecular weight marker (Sigma, #HW-SDS-17S) ; Lane 2 is the 

20 standard hEGF; Lane 3 is the culture for 24hrs without 
addition of IPTG; Lane 4 is the culture for 30hrs without 
addition of IPTG; Lane 5 is the culture for 24hrs with 
addition of IPTG ; and, Lane 6 is the culture for 30hrs with 
addition of IPTG. The amount of the hEGF in cultures for 

25 24hrs and 30hrs were analyzed quantitatively by hEGF 
receptor binding assay; and disclosed in Table 1* 

As clearly illustrated in Table 1, it was determined 
that: total hEGF amount of cultivation for 30hrs was 
343.5mg/L and most of expressed hEGF was secreted out of 

30 cytosol. 



Table 1: 



35 



\ Incubation time 

sample \ 


24 hours 


30 hours 


Periplasm 


10.5mg/L 


9 . 4mg/L 


Culture 


213.8mg/L 


334.1mg/L 


Total 


224 . 3mg/L 


343.5mg/L 
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Example 13 : Mass production of hEGF 

Transformant E. coli DW/BT-2042(KCCM 10027) was 
cultured in 4ml of LB media containing 0.5% glucose and 
5 12.5pg/ml of tetracycline at 37°C for llhrs; and, 400/il of 
the culture was inoculated on 100ml of the same media and 
incubated for llhrs to obtain seed culture. 80ml of seed 
culture was inoculated to 2L of media containing bacto- 
trypton log, yeast extract 20g, ¥H^0 4 3g, Na a HP0 4 -8H a 0 4g, 
10 (NH 4 ) a HP0 4 2.5g, CaCl a 2H a 0 O.Olg, Sigma antifoam A lml, 
glucose 5g and tetracycline 5mg per II* , and incubated at 
30*C. After 4hrs incubation, IPTG was added to the final 
concentration of ImM for induction of hEGF and further 
incubation was carried out for 26hrs. 

15 

Example 14 £ Quantitative assay of produced hEGF 

The amount of expressed hEGF was determined by 
receptor binding assay employing A431 cell line(ATCC GRL 

20 1555), which is modified method of Rieman'sf see ; Rieman, 
M.W., Peptides, 8:877-885(1987)] and Pi August ine • s r see 2 
DiAugustine, R. P., J. Biol. Chem., 260:2807-2811(1985)]. 
The A431 cells were mixed with DMEM medium containing calf 
serum, inoculated on Costa 24well cell culture plate to 4 x 

25 10 5 cells/well and incubated under 5% C0 2 -atmosphere at 37°C 
for 6 or 7 days with the change of new medium every other 
day. Then, removal of medium and washing with saline 
phosphate buffer were followed, and cells were immobilized 
by treatment of 10% formaldehyde for lOmin. After the 

30 removal of formaldehyde was carried out by washing with 
saline phosphate buffer, 250/il of receptor binding buffer 
consisting of 1% BSA, 0.2% sodium azide and saline phosphate 
buffer was added to every well. To the above, were added 
standard hEGF solution diluted to 0.01-20ng/20/il or hEGF 

35 solution to be determined and X2S I-EGF(Amersham, IM196, UK) 
diluted to 30,p00cpm/100/il in a serial manner, and 
incubation with shaking at lOOrpm was followed for 2hrs. 
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Then, each well was rinsed with receptor binding buffer and 
bound cells thereon were separated from the well by the 
incubation with 250jil of cell lysis solution consisting of 
0.1N NaOH and 1% SDS. Radioactivity of ^I-EGF bound cells 
5 were determined with the aid of y-scintillation counter 
(Packard Cobra II, USA) . 

Example 152 Purification of hEGF 

10 2L. of culture prepared in Example 13 was centrifuged 

(Sorvall ni RC 28S, USA) at 8,000rpm for 30min. The super- 
natant was loaded on Amberchrom CG71(Tosohass Corp., USA) 
column(2.5 x 40cm) preequilibrated with 20mM Tris buffer(pH 
8.0). Washing was made with 1L of the same buffer; and step 

15 elution was carried out with 50 0ml of 20mM Tris buffer (pH 
8.0) containing 40% acetonitr ile . Plow rate was controlled 
at 60cm/hr in the course of loading, washing and elution. 
The eluted hEGF fraction was stored at 4°C. 

350ml of hEGF fraction obtained as above was loaded on 

20 Q-Sepharose FF(Pharmacia, USA) column(2.5 x 40cm) preequili- 
brated with 20mM Tris buffer(pH 8.0) and washed with 500ml 
of the same buffer. Elution was made with linear gradient 
of 0M to 0.5M NaCl in 20mM Tris buffer(pH 8.0). In this 
connection, flow rate was controlled at 55cm/hr during 

25 loading, washing and elution. 316. 4mg of hEGF was obtained 
with 80 to 85% of purity. 

The hEGF fraction obtained as above was adjusted to pH 
6.5 with 20% phosphate solution, and loaded on reverse 
phase column(Waters Delta Prep 4000 , USA, 8 x 100mm) and 

30 followed by second C^ reverse phase column(8 x 100mm). The 
C^ reverse phase columns employed were preequilibrated with 
lOmM phosphate buffer (pH 6.5)? and, flow rate was controlled 
at 4ml/min. The elution of hEGF was carried out with lOmM 
phosphate buffer(pH 6.5, buffer A) and lOmM phosphate buffer 

35 containing 70% acetonitrile( buffer B). 98% of purity was 
obtained at the {retention time of 29min. Table 2 illust- 
rates HPLC chromatographic conditions for hEGF purification. 
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Retention 
time (min) 


Flow rate 
(ml/min) 


Buffer ratio 
(A/B, %) 


Gradient 
type 


0 


4 


0 / 100 




2 


4 


74 / 26 


linear 


20 


4 


72 / 28 


linear 


45 


4 


70 / 30 


linear 


70 


4 


66 / 44 


linear 


75 


4 


0 / 100 


linear 


85 


4 


0 / 100 


linear 



The homogeneity of purified hEGP in process of purifi- 
cation was determined by SDS-PAGE(Schagger. H. et al., Anal. 
Biochem., 166:368-379(1987) ) [see: Pig. 12]. In Fig. 12, 

15 Lane A is supernatant obtained by centrifugation of culture; 
Lane B is proteins isolated by Amberchrom CG71 chromato- 
graphy; Lane C is proteins isolated by Q-Sepharose FP anion 
exchange chromatography; and, Lane D is purified hEGP by C^ 
reverse phase HPLC. 

20 The purification step of hEGF is summarized in Table 3. 



Table 3: 



25 



Purifica- 
tion 
step 


Total 
volume 
(ml) 


Total 

hEGF 

(mg) 


Total 
protein 
(mg) 


Yield 
(%) 


Specific 
activity 
(mg hEGF/ 
mg protein) 


Culture 


2,000 


582.1 


5,490 


100 


0.106 


Amberchrom 
CG 71 


350 


436.5 


997 


74.9 


0.438 


Q- 

Sepharose 
FP 


300 


316.4 


414 


54.3 


0.764 


C ia HPLC 


240 


282.3 


285 


48.4 


0.990 



35 282. 3mg of purified hEGF was obtained with the yield 

of 48.4%. Protein concentration was determined by Bradford 
method (Bradford, M. , Anal. Biochem., 72:248(1976)) employing 
protein quantitative kit(Biorad, #500-0006, USA). The 
molecular weight of purified hEGF was determined as 6,000 
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dalton by 15% SDS-PAGE analysis f see : Fig. 13]. In Fig. 13 , 
Lane A is low molecular size weight marker (Biorad, #161- 
0304); Lane B is purified hEGF; and, Lane C is peptide 
marker (Sigma, #MW-SDS-17S) . 
5 The purity of purified hEGF was analysed in HPLC and 

represented in Fig. 14. As clearly disclosed in Fig. 14, it 
was determined that: the purity of hEGF was over 98% and 
degradation of C-terminal and oxidation of methionine were 
not occur ed. 

10 Finally, isoelectric focusing analysis was carried out 

for pi value determination in the pH range of 4 to 6 
ampholite(£S£: Carfin, D.E., Methods Enzymol., 183:459-475 
(1990)) and its result was shown in Fig. 15. In Fig. 15, 
Lanes A and D are low calibration isoelectric focusing 

15 analysis marker (Pharmacia, #17-0472-01, USA); Lanes B and C 
are 2pg of purified hEGF; and, Lane E is 4pg of purified 
hEGF. As well be seen in Fig. 15, it was determined that pi 
of the purified hEGF is 4.55, which is the same pi value of 
prior art hEGF. 

20 

Example 16 : Limulus Amebocyte Lysate test 

The purified hEGF was lyophilized and followed by 
Limulus Amebocyte Lysate test (Associates of Cape Code, USA). 
25 The result showed that the amount of endotoxin in hEGF was 
below 0.36EU per Img, which was extremely low. Accordingly, 
hEGF purification process of the invention was determined to 
be very efficient, while reducing contamination of other 
celluar protein and endotoxin. 



35 
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WHAT IS CLAIMED IS; 

1. A human epidermal growth factor gene comprising 
restriction sites of Hpal at 5' terminal and Pstl at 3' 

5 terminal, and Bpull02I, Nsil, Hlul, Eco47III and Aflll, 
whose nucleotide sequence is represented as: 

5' GTT AAC AGC 6AC TCC GAA TGC CCG CTG AGC CAT GAC GGC 
3' CAA TTG TCG CTG AGG CTT ACG GGC GAC TCG GTA CTG CCG 

10 

TAC TGC CTG CAC GAC GGC GTA TGC ATG TAG ATC GAA GCA 
ATG ACG GAC GTG CTG CCG CAT ACG TAC ATC TAG CTT CGT 

CTG GAC AAA TAC GCG TGC AAC TGT GTT GTT GGC TAC ATC 
15 GAC CTG TTT ATG CGC ACG TTG ACA CAA CAA CCG ATG TAG 

GGC GAG CGC TGT CAG TAC CGT GAC CTT AAG TGG TGG GAA 
CCG CTC GCG ACA GTC ATG GCA CTG GAA TTC ACC ACC CTT 

20 CTG CGC TGATAACCTGCA 3' 

GAC GCG ACTATTGG 5' 

2. A recombinant vector pUE118, which comprises the 
nucleotide sequence of claim 1. 

25 

3. E. coli JM109(KCCM 10026) transformed with the 
recombinant vector of claim 2. 

4. An expression vector pTE105 for human epidermal 
30 growth factor comprising an expression cassette consisting 

of Omp A leader sequence, translation termination sequence 
and transcription termination sequence and hEGF gene, 
wherein the expression cassette is under transcriptional 
regulation by tac promoter. 



35 



5. The expression vector pTElOS of claim 4, which 
comprises replication origin and regulation site originated 
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from pUC19. 

6. The expression vector pTE105 of claim 4, which 
comprises par site for its stabilization in E. coli. 

5 

7. The expression vector pTE105 of claim 4, which 
comprises restriction sites of Nael and PstI between Omp A 
leader sequence and universal translation termination 
sequence. 

10 

8. E. coli JM101(KCCH 10027) transformed with the 
expression vector of claim 4. 

9. A process for preparing human epidermal growth 
15 factor from the culture of E. coli JM101(KCCM 10027) of 

claim 8. 

10. A process for preparing human epidermal growth 
factor, which comprises culturing E. coli JM101(KCCM 10027) 
20 of claim 8. 



25 



30 



35 
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Hpa I Bpull02 I 

5* GIT AAC AGC GAC TCC GAA TGC C CG CTC AGC CAT GAC GGC TAC TGC CTG 
3' CAA TO TCG CTG AG6 CTT ACQ GGC GAC TCG GTA CTG CCG ATG ACG GAC 
INSDSECPLSHDGY .CL 



hEGF ,gene 

Nsi I Mlu I 

CAC GAC GGC G TA TGC ATG TAG ATC GAA GCA CTG GAC AAA TAC GCG 
GIG CTG CCG CAT ACG TAC ATC TAG CTT CGT GAC CTG TIT ATG CGC 
H&GVCMY I EALDK YA 



Eco47 III 

JGC AAC TGT GTT GTT GGC TAC ATC GGC GAG CGC TG T CAG TAC CGT 
ACG TTG ACA CAA CAA CCG ATG TAG CCG CTC GCG ACA GTC ATG GCA 
CN'CVV GY I GERCQYR 



Af 1 II Pst I 

GAC CTT AAG TGG TGG GAA CTG CGC TGATAA CCTGCA 3' 
CTG GAA TTC ACC ACC CTT GAC GCG ACTATTGG 5' 
D L K V W E L R (STOP) 



PIG. 1 
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CI r 30 Her ; 5' (riTAACAGCGACTCCGAATGCOCGCTAGC 3 4 

C2 ,35 «er ; 5' CATGAOGGCTACIGCCTQCAOGAOGGOGrATSCAT 3* 

C3 , 29 «er ; 5" CTACATCGAAGCACTGGACAAATACGCGT 3' 

U ,3d mar. I 5' GCAACIGIGI 161 lUUL'lACATCGGOGAGCGCTGICAGT 3' 

C5 ,41 mer t 5' ACCGlXWCCTrMGTGGTGGGMCnXGCTGATMCCTGCA 3' 

Nl ,29..Ber ; 5' GGTTATCAGCGCAGTnXCACCACTTAAG 3* 

N2 ;38 mar I 5' GTCACGGTACTGACAGCGCICGCCGATGTAQCCAACAA 3* 

N3 ,29 «er : 5* CACAffrrGCACGCCTATnCTCCAGTGCT 3* 

N4 ,36 mer i 5* TCGATGTACATGCATACQOOGTQnGCAGGCAGrAG 3* 

N5 /38 mer ; 5' CCGTCATGOTCAGCGGGCATICGGACnnxrrGTTAAC 3' 



FIG- 2A 
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BamH I j Omp A leader sequence- 



s' SBpSAAAATT^ATG AAA AAG ACA GCT ATC GCG ATT GCA GTG 
3* GTTTTAA TAC TTT TIC TGT CGA TAG CGC TAA CGT CAC 



Nae I 



GCA CTG GCT GGT TTC GCT ACC GTA GCG CAG 
CGT GAC CGA CCA AAG CGA TGG CAT CGC GTC 



universal 

1 T terSiiatiSn 1 1 r**P A transcription 
J sequence j j 

ffig CTT AAT TAA TTA AGC AGC COG CCT AAT GAG 

gl|JSg GAA TTA ATT AAT TCG ICG GGC GGA TTA CTC 



termination sequence} ^ba I 

CGG OCT TIT TIT $ 3* 
GCC CGA AAA AAA SS^BC 5' 



FIG- 5 
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Hindlll 



BamHI 




Trp A transcription 

termination sequence 

universal translation 
termination sequence 



Omp A leader^ sequence 



X 



1 



BamHI 



Nael PstI 




P villi 

ligat^ with Xbal linker 
Xbal BamHX 
I 

2.4kb DNA fragment 



ligation 
I 

BamHI NaeI 
Hindlll— ^ | /ll^pstl 

•Xbal 




FIG. 6 
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BamHI NaeI 




0.1 7kb DNA fragment 



ligation 




FIG. 7 
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ligate with EcoRI linker 
EcoRI, Hindi 
0.37kb DNA fragment 



ligation 
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Hindlll BamHI 
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FIG. 11A 
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